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Avian Hepatitis E Virus, Vaccines and Methods of Protecting 
Against Avian Hepatitis-Splenomegaly Syndrome 
and Mammalian Hepatitis E 



CROSS-REFERENCE TO RELATED U.S. APPLICATIONS 
10 This application claims the benefit under 35 U.S.C. § 1 19 (e) of U.S. Provisional 

Application No. 60/259,846, filed January 5, 2001. 

STATEMENT REGARDING FEDERALLY SPONSORED 
(^%p If&S /O^SE ARCH OR DEVELOPMENT 

he protect resulting in the present invention has been supported in part by grants 
rational Institutes of Health (AI01653-01, AI46505-01). 




REFERENCE TO A "Sequence Listing" 

The material on a single compact disc containing a Sequence Listing file provided 
20 in this application is incorporated by reference. The date of creation is September 9, 
2002 and the size is approximately 21.4 KB. 



BACKGROUND OF THE INVENTION 



Field of the Invention 



25 The present invention concerns a novel avian hepatitis E virus, immunogenic 

compositions, diagnostic reagents, vaccines and methods of detecting or protecting 
against avian hepatitis-splenomegaly syndrome and mammalian hepatitis E. 
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causative agent and diagnosing hepatitis E in a mammal or hepatitis-splenomegaly 
syndrome in an avian species which employ the nucleotide sequence described herein, 
antibodies raised or produced against the immunogenic compositions or antigens (such as 
ORF2, ORF3, etc.) expressed in a baculovirus vector, £. coli and the like. The invention 
5 further embraces methods for detecting avian HEV nucleic acid sequences in an avian or 
mammalian species using nucleic acid hybridization probes or oligonucleotide primers 
for polymerase chain reaction (PCR). 

BRIEF DESCRIPTION OF THE DRAWINGS 
10 The background of the invention and its departure from the art will be further 

described hereinbelow with reference to the accompanying drawings, wherein: 

Figure 1 shows the amplification of the 3' half of the avian HEV genome by RT- 
PCR: Lane M, 1 kb ladder; Lanes 1 and 2, PCR with ampliTaq gold polymerase; Lanes 3 
and 4, PCR with ampWTaq gold polymerase in the presence of 5% v/v dimethyl sulfoxide 
15 (hereinafter referred to as "DMSO"); Lane 5 and 6, PCR amplification with a mixture of 
Taq polymerase and pfu containing in an eLONGase® Kit (GIBCO-BRL, Gaithersburg, 

MP). : - 

Figures 2A and 2B represent the amino acid sequence alignment of the putative 
RNA-dependent RNA polymerase (RdRp) gene of avian HEV (which corresponds to 
20 SEQ ID NO:4) with that of known HEV strains. The conserved GDD motif is 
underlined. The sequence of the prototype Burmese strain is shown on top, and only 
differences are indicated. Deletions are indicated by hyphens (-). 

Figures 3A-3C represent the sequence alignment of the ORFs 1, 2 and 3 
overlapping region. The sequence of the prototype Burmese strain is shown on top, and 
25 only differences are indicated in other HEV strains. The sequence of avian HEV (which 
corresponds to SEQ ID NO: 12) is shown at the bottom. The start codons are indicated by 
arrows, and the stop codons are indicated by three asterisks (***). The two PCR primers 
(FdelAHEV and RdelAHEV) used to amplify the region flanking the deletions are 
indicated. Deletions are indicated by hyphens (-). 
30 Figure 4 shows the hydropathy plot of the putative ORF2 protein of avian HEV. 

A highly hydrophobic domain is identified at the N-terminus of the protein followed by a 
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hydrophilic region. The hydrophobic domain is the putative signal peptide of ORF2. 
The horizontal scale indicates the relative position of amino acid residues of the ORF2. 

Figures 5A-5C represent the amino acid sequence alignment of the putative 
capsid gene (ORF2) of avian HEV (which corresponds to SEQ ID NO:6) with that of 
known HEV strains. The putative signal peptide sequence is highlighted, and the 
predicted cleavage site is indicated by arrowheads. The N-linked glycosylation sites are 
underlined in boldface. The sequence of the prototype Burmese strain is shown on top, 
and only differences are indicated in other HEV strains. The conserved tetrapeptide 
APLT is indicated (asterisks). Deletions are indicated by hyphens (-). 

Figure 6 illustrates the sequence alignments of the 3 f noncoding region (NCR) of 
avian HEV (which corresponds to SEQ ID NO: 1 3) with that of known HEV strains. The 
3' NCR of avian HEV is shown on top, and only differences are indicated in other HEV 
strains. Deletion s are indicated by hyphens (-). . 



Figure 7 represents the RT-PCR amplification of the avian HEV genomic region 
with a major deletion: Lane M, 1 kb ladder; Lanes 1 and 2, PCR amplification without 
DMSO; Lane 3, PCR amplification in the presence of 5% v/v DMSO; Lane 4, PCR 
amplification in the presence of 5% v/v formamide. 

Figures 8A-8C provide phylogenetic trees based on the sequences of different 
genomic regions of HEV wherein Fig. 8 A is a 439 bp sequence of the helicase gene, Fig. 
8B is a 196 bp sequence of the RNA-dependent RNA polymerase gene and Fig. 8C is a 
148 bp sequence of the ORF2 gene. The sequences in the three selected regions are 
available for most HEV strains. 




Figures 9A-9C represent the entire 4 kb nucleotide sequence (3931 bp plus 
poly (a) tract at 3' end) of the avian hepatitis E virus (which corresponds to SEQ ID 
NO:l). ■ 



Figure 10 represents the predicted amino acid sequence of the protein encoded by 
the helicase gene (which corresponds to SEQ ID NO:2), 

Figure 11 represents the nucleotide sequence (439 bp) of the helicase gene (which 
corresponds to SEQ ID NO:3). 

Figure 12 represents the predicted amino acid sequence of the protein encoded by 
the RdRp gene (which corresponds to SEQ ID NO:4). 
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Table 1. Synthetic oligonucleotide primers used for PCR amplification and DNA 
sequencing of the avian HEV genome 



Primer Designation Nucleotide Sequence (5' to 3') 



Position 



Sequencing 
Primers 




CAATCTCGACCAGCACCCCACCAA (SEQ ID NO: 14) 407-384 
/ CCGGGAGCGCTGTAGTGTGATTGATGT 358-384 " 

AC AGGCCCGGGTGG ATTT ATGG (SEQ ID NO: 1 5) 61 8-597 

CAATCAACCCCTCAACACTGGA 840-861 
GTGCAACAGGGTC ATCCAGCGTAAAT (SEQ ID NO: 1 6) 1 007-982 
GGATGCCCGATTGGATGGTAGCCTT 1275-1299 
AAGGCTACCATCCAATCGGGCATCC (SEQ ID NO: 17) 1299-1275 
TCCCGGGAGCTGGTGTTGTCTGC 1602-1624 
GATGCCCGATTGGATGGTAGCCTTGTA 1276-1302 
ATGTCGGGCCCCCAGTTCTTGTCAG (SEQ ID NO: 18) 1677-1653 
CAATGTGCTGCGGGGTGTCAAG 
CCCTTGACACCCCGCAGCACATT (SEQ ID NO: 19) 
TATAGAGAAGCCGCCCACCGCATTTG(SEQ ID NO:20) 2439-2414 
GACCAATTTCGCCATCCTCAGCAGT (SEQ ID NO:21) 2914-2890 
ACCG AC AT AT AC AGTTTC ACCTC AG (SEQ ID NO:22) 
CTGAGGTGAAACTGTATATGTCGGT 
GAACGGCGAGCCTGAGGTGAAACTGT 
CAATAGGCCATGCTTATAGAGAA (SEQ ID NO:23) 
GCATACCAAACCACGGAGCTACCATTCTG 
(SEQ ID NO:24) 

TCTTCAGAATGGTAGCTCCGTGGTTTG 
GCTAGGCGACCCGCACCAGAT 
GACTCGAGTCGACATCGA(T)i7 
GACTCGAGTCGACATCGA 
GGGGCCCGACATTCAGCGGATGCAG 
GCCGCGGTGACAACGTCTGTGAGAGG 
(SEQ ID NO:25) 



F4AHEV 
AP 

PAUP 
FdelAHEV 
RdelAHEV 



2015-2036 
2038-2016 



3065-3041 
3041-3065 
3030-3055 
2453-2431 
3572-3544 



3540-3566 
non-viral (GIBCO) 
non-viral (GIBCO) 
non-viral (GIBCO) 
1666-1690 
2168-2143 



"The positions are relative to the 3931 bp sequence of avian HEV (corresponding to SEQ 
ID NO:l) determined in the present invention. 
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were selected and sequenced for both DNA strands. The number of poly (A) residues at 
the 3' end of each of the three cDNA clones was different (19, 23, and 26 residues, 
respectively), indicating that these 3 clones sequenced represent independent cDNA 
clones. This 4 kb genomic fragment contains the complete ORFs 2 and 3 (set forth in 
SEQ ID NO:7 and SEQ ID NO:9, respectively), the complete RNA-dependent RNA 
polymerase (RdRp) gene (set forth in SEQ ID NO:5), a partial helicase gene of the ORF1 
(set forth in SEQ ID NO:3), and the complete 3* noncoding region (NCR) (set forth in 
SEQ ID NO: 13). 



10 EXAMPLE 8 

Sequence Analysis of the ORF1 Region 
The sequences of the three independent cDNA clones have the same size but 
differ in 16 nucleotide positions. However, at any given position, two of the three cDNA 
clones have the same nucleotide. Therefore, a consensus sequence was produced. The 
15 resulting consensus sequence of the 3' half genomic fragment of avian HEV is 3,931 
nucleotides in length, excluding the poly (A) tract at the 3' end and the sequence of the 5* 
sense primer used for amplification. Sequence analysis revealed that the novel virus 
associated with HS in chickens is genetically related to human and swine HEV. Two 
complete ORFs (ORFs 2 and 3), and one incomplete ORF1 were identified in this 

20 genomic region. . 

— TEeincomplete ORF1 sequence of avian HEV was aligned with the 

corresponding regions of human and swine HEV strains. Significant nucleotide and 
amino acid sequence identities were found in the ORF1 region between avian HEV and 
known HEV strains (Table 2, below). The avian HEV ORF1 region sequenced thus far 
25 contained the complete RdRp gene and a partial helicase gene. The RdRp gene of avian 
^jj HE V encodes 483 amino acid residues and terminates at the stop codon of ORFL A 
GDD motif (positions 343-345 in RdRp gene) that is believed to be critical for viral 
replication was identified (Figs. 2A-2B corresponding to SEQ ID NO:4), and this motif 
was found in all RdRps (G. Kamer et al, "Primary structural comparison of RNA- 
30 dependent polymerases from plant, animal and bacterial viruses, " Nucleic Acids Res. 
12:7269-7282 (1984)). The RdRp gene of avian HEV is 4 amino acid residues shorter 
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than that of known HEV strains (Figs. 2A-2B corresponding to SEQ ID NO:4), and 
shared 47% to 50% amino acid and 52% to 53% nucleotide sequence identity with that of 
known HEV strains (Table 2, below). The C-terminal 146 amino acid residues of the 
incomplete helicase gene of avian HEV shared approximately 57-60% nucleotide 
5 sequence and 58-60% amino acid sequence identities with the corresponding region of 
s~\ y> other HEV strains. The helicase gene of avian HEV is the most conserved region 
/ H J i compared to known HEV strains. There is no deletion or insertion in this partial helicase 
7? i/[C/w J § ene re §* on between avian HEV and other HEV strains. A 439 bp sequence of BLSV is 
available in the helicase gene region (C. J. Payne et aL, 1999, supra), and avian HEV 
10 shared 80% nucleotide sequence identity with BLSV in this region. 



15 



20 



25 



30 
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EXAMPLE 9 
Sequence Analysis of the ORFs 2 and 3 
The ORF2 gene of avian HEV consists of 1,821 nucleotides with a coding 
capacity of 606 amino acids, about 60 amino acids shorter than that of other HEV strains. 
5 The ORF2 gene of avian HEV overlaps with ORF3 (Figs. 3A-3C corresponding to SEQ 
ID NO: 12), and terminates at stop codon UAA located 130 bases upstream the poly (A) 
tract. The predicted amino acid sequence of ORF2 contains a typical signal peptide at its 
N-terminus followed by a hydrophilic domain (Fig. 4). The sequence of the avian HEV 
signal peptide is distinct from that of known HEV strains (Figs. 5A-5C corresponding to 
10 SEQ ID NO: 6). However, it contains common signal peptide features that are necessary 
for the translocation of the peptide into endoplasmic reticulum: a positively charged 
amino acid (Arginine) at its N-terminus, a core of highly hydrophobic region (rich in 
Leucine residues) and a cleavage site (SRG-SQ) between position 19 and 20 (Figs. 5A- 
5C corresponding to SEQ ID NO:6). Sequence analysis of the ORF2 revealed that the 
15 region between the signal peptide and the conserved tetrapeptide APLT (positions 108- 
1 1 1) is hypervariable, and 54 amino acid residues of avian HEV are deleted in this region 
(Figs. 5A-5C corresponding to SEQ ID NO:6). Three putative N-linked glycosylation 
sites were identified in the ORF2 of avian HEV: NLS (pos. 255-257), NST (pos. 510- 
512) and NGS (pos. 522-524). Three N-linked glycosylation sites were also identified in 
20 known HEV strains but the locations are different from those of avian HEV. The first 
glycosylation site in known HEV strains is absent in avian HEV (Figs. 5A-5C 
corresponding to SEQ ED NO:6), and the third glycosylation site in avian HEV is absent 
in the known HEV strains. 

The ORF2 gene of known HEV strains varies slightly in size, ranging from 655 to 
25 672 amino acid residues, but most strains have a ORF2 gene of 660 amino acid residues. 
The ORF2 of avian HEV has 606 amino acid residues, which is 54 amino acids shorter 
than that of most known HEV strains. The deletions are largely due to the shift of the 
ORF2 start codon of avian HEV to 80 nucleotides downstream from that of known HEV 
strains (Figs. 3 A-3C corresponding to SEQ ID NO: 12). The putative capsid gene (ORF2) 
30 of avian HEV shared only 42% to 44% amino acid sequence identity with that of known 
HEV strains (Table 3, below), when the major deletion at the N-terminus is taken into 
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consideration. However, when the N-terminal deletion is not included in the comparison, 
avian HEV shared 48% to 49% amino acid sequence identity with the corresponding 
region of other HEV strains. 

Multiple sequence alignment revealed that the normal start codon of the ORF3 
5 gene in known HEV strains does not exist in avian HEV due to base substitutions (Figs. 
3A-3C corresponding to SEQ ID NO: 12). Avian HEV utilizes the ORF2 start codon of 
other HEV strains for its ORF3, and consequently the ORF3 of avian HEV starts 41 
nucleotides downstream from the start codon of known HEV strains (Figs. 3A-3C 
corresponding to SEQ ID NO: 12). Unlike known HEV strains, the ORF3 gene of avian 

10 HEV does not overlap with the ORF1 and locates 33 bases downstream from the ORF1 
stop codon (Figs. 3A-3C corresponding to SEQ ID NO: 12). The ORF3 of avian HEV 
consists of 264 nucleotides with a coding capacity of 87 amino acid residues, which is 24 
to 37 amino acid residues shorter than that of known HEV strains. Sequence analysis 
indicated that the ORF3 of avian HEV is very divergent compared to that of known HEV 

15 strains. 
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EXAMPLE 10 
Sequence Analysis of the 3' NCRs 
The region between the stop codon of the ORF2 and the poly (A) tail of avian 
HEV, the 3' NCR, is 127 nucleotides (set forth in SEQ ID NO: 13). Sequence analysis 
5 revealed that the 3 ' NCR of avian HEV is the longest among all known HEV strains. The 
3 NCRs of known HEV strains range from 65 to 74 nucleotides (Fig. 6 corresponding to 
SEQ ID NO: 13). Multiple sequence alignment indicated that the 3' NCRs of HEV is 
highly variable, although a stretch of sequence immediately proceeding the poly (A) tract 
is relatively conserved (Fig. 6 corresponding to SEQ ID NO: 13). 

10 

EXAMPLE 1 1 
Identification of a Major Deletion in the ORFs 2 and 3 
Overlapping Region of Avian HEV 
Sequence analyses revealed a major deletion of 54 amino acid residues in avian 

15 HEV between the putative signal peptide and the conserved tetrapeptide APLT of the 
ORF2 (Figs. 5A-5C corresponding to SEQ ID NO.6). To rule out the possibility of RT- 
PCR artifacts, a pair of avian HEV-specific primers flanking the deleted region was 
designed (Table 1, Figs. 3A-3C). The 3* antisense primer (RdelAHEV) located before 
the ORF3 stop codon of avian HEV, and the 5' sense primer (FdelAHEV) located within 

20 the C-terminal region of the ORF1. To minimize potential secondary structure problems, 
reverse transcription was performed at 60°C with a One Step RT-PCR Kit (Qiagen Inc., 
Valencia, CA). PCR was performed with 35 cycles of denaturation at 95°C for 40 
seconds, annealing at 55°C for 30 seconds and extension at 72°C for 1 minute. In 
addition, PCR was also performed with shorter annealing time and higher denaturation 

25 temperature to avoid potential problems due to secondary structures. The PCR reaction 
consisted of an initial enzyme activation step at 95°C for 13 minutes, followed by 35 
cycles of denaturation at 98°C for 20 seconds, annealing at 55°C for 5 seconds and 
extension at 73°C for 1 minute. It has been reported that formamide or DMSO could 
enhance the capability of PCR to amplify certain genomic regions of HEV (S. Yin et ai, 

30 "A new Chinese isolate of hepatitis E virus: comparison with strains recovered from 
different geographical regions," Virus Genes 9:23-32 (1994)). Therefore, a sufficient 
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amount to make 5% (v/v) of formamide or DMSO was added in the PCR reactions. A 
PCR product of the same size (502 bp) as observed in a conventional PCR is produced 
with various different RT-PCR parameters and conditions including the addition of 5% 
(v/v) of formamide or DMSO, the use of higher denaturation temperature and short 
annealing time, and the synthesis of cDNA at 60°C (Fig. 7). The deletion was further 
confirmed by directly sequencing the 502 bp PCR product. 

EXAMPLE 12 
Phvlogenetic Evidence of Avian HEV as a New Genotype 
Phylogenetic analyses based on three different genomic regions of HEV (a 439 bp 
of the helicase gene, a 196 bp of the RdRp gene, and a 148 bp of the ORF2 gene) 
identified at least 5 distinct genotypes of HEV (Fig. 8). The topology of the three trees 
based on different genomic regions is very similar. Similar phylogenetic trees were also 
produced with the complete RdRp and ORF2 genes of HEV strains in which their 
sequences are known. Most Asian strains of HEV are related to the prototype Burmese 
strain and clustered together, and these Burmese-like Asian strains of HEV represent 
genotype 1. The African strains of HEV (Egypt 93, Egypt 94 and Morroco) were related 
to, but distinct from, Burmese-like strains in the genotype 1. The limited sequences 
available for these African strains do not allow for a determination of whether they 
represent a distinct genotype or a subgenotype within the genotype 1. The single 
Mexican strain of HEV represents genotype 2. The genotype 3 of HEV consists of two 
U.S. strains of human HEV (USl, US2), a U.S. strain of swine HEV, a New Zealand 
strain of swine HEV, and several European strains of human HEV (Greek 1, Greek 2, 
Italy). The genotype 4 includes several strains of HEV identified from patients in China 
(HEV-T1, Ch-Tll, Ch-T21, 93G) and Taiwan (TW7E, TW4E, TW8E). Avian HEV is 
the most divergent and represents the new genotype 5. Based on the limited sequence 
available for BLSV, it appears that the BLSV identified from chickens in Australia 
clustered with the genotype 5 of avian HEV, but the avian HEV retained significant 
differences in nucleotide sequence indicating that the avian HEV represents a new and 
distinct viral strain. Phylogenetic evidence that avian HEV is the most divergent strain of 
HEV identified thus far and represents a new genotype. 
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